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(57) ABSTRACT 

An ISYNC instruction does not cause a flush of specula- 
tively dispatched or fetched instructions (instructions that 
are dispatched or fetched after the ISYNC instruction) 
unconditionally. The present invention detects the occur- 
rence of any instruction that changes the state of the m achine 
and requires a context synchronizing complete; these 
instructions are called context-synchronizing-required 
instructions. When a context-synchronizing-required 
instruction completes, the present invention sets a flag to 
note the occurrence of that condition. When an ISYNC 
instruction completes, the present invention causes a flush 
and refetches the instruction after the ISYNC if the context- 
synchronizing-required flag is active. The present invention 
then resets the context-synchronizing-required flag. If the 
context-synchronizing-required flag is not active, then the 
present invention does not generate a flush operation. 
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SYSTEM AND METHOD FOR HANDLING instruction has executed completely (its data has been 

INSTRUCTIONS OCCURRING AFTER AN fetched and returned). Prior implementation of the isync 

ISYNC INSTRUCTION cause a flush and refetch of the instruction after the isync 

thus ensuring that the Id instruction will execute after the 

CROSS-REFERENCE TO RELATED 5 larwx instruction. 

APPLICATIONS This type of usage is implemented often, and removing 
This patent application is related to U.S. patent applica- ,he A^h penalty can improve the performance of the po- 
tion Ser. No. 08/893,070, filed Jul. 15, 1997, which is hereby cessor Therefore, there is a need in the art for improving 
incorporated by reference herein. "P°° ' he lush Penalty associated with the execution of 

10 ISYNC instructions. 



TECHNICAL FIELD 



SUMMARY OF THE INVENTION 



The present invention relates in general to data processing 

systems, and in particular, to the execution of machine In lne present invention, an ISYNC instruction does not 

context synchronization operations. cause a flush of speculatively dispatched or fetched instruc- 
tions (instructions that are dispatched or fetched after the 

BACKGROUND INFORMATION ISYNC instruction) unconditionally. The present invention 

_ . ... . , detects the occurrence of any instruction that changes the 

Within certain processor architectures, memory synchro- j • .. u •• 

... .... .... . . . • l state of the machine and requires a context synchronizing 

nization instructions control the order in which memory , , .. 1. ., } . , ° 

. . , ' complete; these instructions are called context- 
operations are completed with respect to asynchronous 20 I ■ ■ ■ j • . .• «,u . . 

r , ... , • . • . .■ synchromzing-required instructions. When a context- 
events, and the order in which memory operations are seen ' , . . 0 . , . . , , ., 
... u • a synchromzing-required instruction completes, the present 
by other processors or memory access mechanisms. An . .. . „ . . r , ' . r .... 

. . . r , /.nv.,™ '. . .. ., invention sets a Dag to note the occurrence of that condition; 

instruction synchronize (ISYNC) instruction causes the pro- ., . _ . ° . .. . .. • j 

. j- j 11 r . i_ j • . .• : r this nag is referred to as a context-synchronizine-required 

cessor to discard all prefetched instructions, wait for any „ .... . . . , 

....... , . , .1 1 . . flag. When an ISYNC instruction completes afterward, the 

preceding instructions to complete, and then branch to the 25 . • .• a u j c . t. .■_ 

r . 0 .... .. , Z- . L .. a . c , • present invention causes a flush and refetches the mstmc- 

□ext sequential instruction (which has the effect of clearing f. ,, ., .„.„.„ ... ., . , , . . . , 

,. ... ...... . . .. > r, t _i_ tions after the ISYNC if the context-synchronizing-required 

the pipeline behind the ISYNC instruction). For further „ .. . . ./ . ., . _. 

,. rr . .. c . „_ flag is active. The present invention then resets the context- 
discussion of the ISYNC instruction, refer to PowerPC 603 . . , „ T , ., . . ... 

RISC Microprocessor User's Manual, Copyright 1994, synchronizing-required flag. If the contex -synchronizing- 

... ... .. . „ »» l- /- required flag is not active, then the present invention does 

Motorola, Inc. and International Business Machines Corp. , 30 . . a u '. -~ r . • .. .. 

a -m a -71 c ~,a j ,, 01 j un n^'c . not generate a flush operation. The present mvention thus 

pp. 4-7 1, 4-72, 5-24 and H-83, and PowerPC System , " c r r ., „ , \. ... .... 

. , : ' t ol i /-> • u. inncu ji reduces the frequency of the flush action significantly, thus 

Architecture, Tom Shanley, Copyright 1995 by MindShare, . • .u _c r 

. ,. ... t . • . j u <• improving the performance of the processor. 

Inc. , p. 113, which are hereby incorporated by reference r 0 r r 

nerem ' " ' Furthermore, the context-synchronizing-required flag is 

o_ • „_ ,„»_.. i-,- 11.. a.„u.j .11 ,c not set when the context-synchronizing-required instruction 

Prior art processors unconditionally flushed all specula- 35._,. ,., j.i - , , j- 

tive instructions that were dispatched or fetched after an I s dls P*! ched - In f ad - is set when the corresponding 

ISYNC instruction. "Flush" refers to the clearing of the completes. This implementation solves the prob- 

processor of instructions. This unconditional flush action lem of s P<™laUve context-synchromzmg-rcqun-ed instruc- 

resulted in flushing and restarting the instruction stream t ' ons ^ thn 8 the context-synchronizing-required flag and 

several times for every one thousand instructions. The more 40 n m &' 

speculative the design allows, the more performance loss The foregoing has outlined rather broadly the features and 

this flush action caused. Furthermore, such flush actions can technical advantages of the present invention in order that 

result in an average of over a dozen cycles in penalty for lDe detailed description of the invention that follows may be 

subsequently re-retrieving the instructions. ' bettcr understood. Additional features and advantages of the 

The ISYNC instruction is often used as a barrier in lock « invention will be described hereinafter which form the 

sequences. One example is illustrated below using PowerPC sub J ecl of the clauns of the lnventl0n - 

instructions. BRIEF DESCRIPTION OF THE DRAWINGS 

For a more complete understanding of the present 
50 invention, and the advantages thereof, reference is now 
made to the following descriptions taken in conjunction with 
the accompanying drawings, in which: 

FIG. 1 illustrates a data processing system configurable in 
accordance with the present invention; 

FIG. 2 illustrates a processor configurable in accordance 
with the present invention; and 

In this example, a critical section is protected and a process FIG 3 illustrates a flow diagram of an embodiment of the 

can enter only if it obtains a lock. The larwx instruction present invention, 

establishes a reservation (the lock to the critical section), the 60 .... 
stcwx instruction checks if the reservation is obtained, the 

bne instruction branches back to the larwx instruction if the In the following description, numerous specific details are 

reservation is not obtained. If the reservation is obtained, the set forth such as specific word or byte lengths, etc. to provide 

branch will be unsuccessful and this process is allowed to a thorough understanding of the present invention. However, 

proceed by executing the isync and subsequent instructions, 65 it will be obvious to those skilled in the art that the present 

which include a Id instruction. The isync is needed to ensure invention may be practiced without such specific details. In 

that the Id instruction can only be executed after the larwx other instances, well-known circuits have been shown in 
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Id 
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block diagram form in order not to obscure the present cessor 210 and other devices coupled to system bus 211, 

invention in unnecessary detail. For the most part, details such as main memory (RAM) 250 and nonvolatile mass 

concerning timing considerations and the like have been storage 252, by participating in bus arbitration. The data 

omitted inasmuch as such details are not necessary to obtain processing system illustrated in FIG. 2 may include other 

a complete understanding of the present invention and are 5 unillustrated devices coupled to system bus 211, which are 

within the skills of persons of ordinary skill in the relevant 001 necessary for an understanding of the following descrip- 

arl " tion and are accordingly omitted for the sake of simplicity. 

n( . i , . . , . , . . . BIU 212 is connected to instruction cache and MMU 

Refer now to the drawings wherein depicted elements are ... ,. . .. ... . , . . , j». wi , 

... . " , • •■ (Memory Management Unit) 214 and data cache and MMU 

not necessarily shown to scale and wherein like or similar \ u ^ cp 5 2W Hi h d cacn such as lhose 

elements are designated by the same reference numeral w w j tDm instruction cache and MMU 214 and data cache and 

through the several views. MMU 216, enable CPU 210 to achieve relatively fast access 

Referring first to FIG. 1, an example is shown of a data times to a subset of data or instructions previously trans- 
processing system configurable in accordance with the ferred from main memory 250 to the caches, thus improving 
present invention. The system has a central processing unit the speed of operation of the data processing system. Data 
("CPU") 210, such as a PowerPC microprocessor 15 and instructions stored within the data cache and instruction 
("PowerPC is a trademark of IBM Corporation) according cache, respectively, are identified and accessed by address 
to the "PowerPC Architecture: A Specification for a New tags, which each comprise a selected number of high-order 
Family of RISC Processors," 2d edition, 1994, Cathy May, b«s of the physical address of the data or instructions in 
et al. Ed., which is hereby incorporated by reference herein. main memory 250. Instruction cache and MMU 214 is 
A more specific implementation of a PowerPC micropro- 20 ^ nh ^ coupled to sequential fetcher 217, which fetches 
cessor is described in the "PowerPC 604 RISC Micropro- '™f«»f°™ execution from instruction cache and MMU 
cessor User's Manual," 1994, IBM Corporation, which is "4 during each cycle. Sequential fetcher 217 transmits 
... . j u t l ■ branch instructions fetched from instruction cache and 

hereby incorporated by reference herein. MMU 214 to branch processing unit ("BPU") 218 for 

The CPU 210 is coupled to various other components by 25 execution, but temporarily stores sequential instructions 
system bus 211. Read only memory ("ROM") 116 is coupled w j tD i D instruction queue 219 for execution by other execu- 
to the system bus 211 and includes a basic input/output tion circuitry within CPU 210. 

system ("BIOS"), which controls certain basic functions of i n the depicted illustrative embodiment, in addition to 
the data processing system. Random access memory gpij 218, the execution circuitry of CPU 210 comprises 
("RAM") 250, I/O adapter 118, and communications adapter 3Q multiple execution units for executing sequential 
134 are also coupled to the system bus 211 . I/O adapter 118 instructions, including fixed-point unit ("FXU") 222, load/ 
may be a small computer system interface ("SCSI") adapter st0 re unit ("LSU") 228, and floating-point unit ("FPU") 230. 
that communicates with a disk storage device 120 or tape Each of execution units 222, 228 and 230 typically executes 
storage drive 140. I/O adapter 118, disk storage device 120, one or more instructions of a particular type of sequential 
and tape storage device 140 are also referred to herein as J5 instructions during each processor cycle. For example, FXU 
mass storage 252. Communications adapter 134 intercon- 222 performs fixed-point mathematical and logical opera- 
nects bus 211 with an outside network enabling the data t ions such as addition, subtraction, ANDing, ORing, and 
processing system to communicate with other such systems. XORing, utilizing source operands received from specified 
Input/output devices are also connected to system bus 211 general purpose registers ("GPRs") 232. Following the 
via user interface adapter 122 and display adapter 136. 4Q execution of a fixed-point instruction, FXU 222 outputs the 
Keyboard 124, trackball 132, mouse 126, and speaker 128 d ata results of the instruction to GPR buffers 232, which 
are all interconnected to bus 211 via user interface adapter provide storage for the result received on result bus 262. 
122. Display monitor 138 is connected to system bus 211 by Conversely, FPU 230 typically performs single and double- 
display adapter 136. In this manner, a user is capable of precision floating-point arithmetic and logical operations, 
inputting to the system through the keyboard 124, trackball 45 sucn as floating-point multiplication and division, on source 
132, or mouse 126 and receiving output from the system via operands received from floating-point registers ("FPRs") 
speaker 128 and display 138. Additionally, an operating 236. FPU 230 outputs data resulting from the execution of 
system such as AIX ("AIX" is a trademark of the IBM floating-point instructions to selected FPR buffers 236, 
Corporation) is used to coordinate the functions of the WD j ca store the result data. As its name implies, LSU 228 
various components shown in FIG. 1. 50 typically executes floating-point and fixed-point instructions 

It should be noted that the data processing system con- which either load data from memory (i.e., either the data 
figured in accordance with the present invention may be a cache within data cache and MMU 216 or main memory 
multi-processing system including processors 101 and 102, 250) into selected GPRs 232 or FPRs 236 or which store 
in addition to processor 210, coupled to system bus 211. data from a selected one of GPRs 232 or FPRs 236 to 

With reference now to FIG. 2, there is depicted a block 55 memory 250. 
diagram of an illustrative embodiment of a data processing CPU 210 employs both pipelining and out-of-order 
system for processing information in accordance with the execution of instructions to further improve the performance 
invention recited within the appended claims. In the of its superscalar architecture. Accordingly, instructions can 
depicted illustrative embodiment, CPU 210 comprises a be executed by FXU 222, LSU 228, and FPU 230 in any 
single integrated circuit superscalar microprocessor. 60 order as long as data dependencies are observed. In addition, 
Accordingly, as discussed further below, CPU 210 includes instructions are processed by each of FXU 222, LSU 228, 
various execution units, registers, buffers, memories, and and FPU 230 at a sequence of pipeline stages. As is typical 
other functional units, which are all formed by integrated of high-performance processors, each sequential instruction 
circuitry. CPU 210 is coupled to system bus 211 via bus is processed at five distinct pipeline stages, namely, fetch, 
interface unit (BIU) 212 and processor bus 213, which like 65 decode/dispatch, execute, finish, and completion, 
system bus 211 includes address, data, and control buses. During the fetch stage, sequential fetcher 217 retrieves 
BIU 212 controls the transfer of information between pro- one or more instructions associated with one or more 
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memory addresses from instruction cache and MMU 214. lions are instructions whose execution may cause (1) the 

Sequential instructions fetched from instruction cache and re-dispatch of subsequent instructions that have already been 

MMU 214 are stored by sequential fetcher 1217 within dispatched, or (2) the abandonment of subsequent instruc- 

instruction queue 219. In contrast, sequential fetcher 217 lions that have already been dispatched. The CSI instruc- 

removes (folds out) branch instructions from the instruction 5 lions fall into this category. Therefore, all CSI instructions 

stream and forwards them to BPU 218 for execution. BPU are qualified for the GCT. 

218 includes a branch prediction mechanism, which in one When the execution of an interruptible instruction causes 

embodiment comprises a dynamic prediction mechanism redispatch and abandonment of subsequent instructions, 

such as a branch history table. That enables BPU 218 to tnen us execution "flushes" the subsequent instructions, 

speculatively execute unresolved conditional branch instruc- to , n ^ M ocessors> m uncond i tiona i flush 0 f all 

Hons by predicting whether or not the branch will be taken. speculative instructions that were dispatched or fetched after 

During the decode/dispatch stage, dispatch unit 220 an ISYNC instruction was performed. In the present 
decodes and dispatches one or more instructions from invention, an ISYNC instruction does not cause a flush of 
instruction queue 219 to execution units 222, 228, and 230, speculatively dispatched or fetched instructions (instructions 
typically in program order. In a more conventional 15 that are dispatched or fetched after the ISYNC instruction) 
processor, dispatch unit 220 allocates a rename buffer within unconditionally. The present invention detects the occur- 
GPR rename buffers 233 or FPR rename buffers 237 for each rence 0 f any instruction that changes the state of the machine 
dispatched instruction's result data, and at dispatch, instruc- an( j requires a context synchronizing complete; these 
tions are also stored within the multiple-slot completion instructions are called context-synchronizing-required 
bufferof completion unit 240 to await completion. However, 20 instructions. When a context-synchronizing-required 
the present invention is adaptable to embodiments which instruction completes, the present invention sets a (lag to 
require neither rename registers or completion units. note the occurrence of that condition; this flag is referred to 
According to the depicted illustrative embodiment, CPU 210 as a context-synchronizing-required flag. When an ISYNC 
tracks the program order of the dispatched instructions instruction completes, the present invention causes a flush 
during out-of-order execution utilizing unique instruction 25 an d refetches the instructions after the ISYNC if the context- 
identifiers, synchronizing-required flag is active. The present invention 

During the execute stage, execution units 222, 228, and then resets the context-synchronizing-required flag. If the 

230 execute instructions received from dispatch unit 220 context-synchronizing-required flag is not active, then the 

opportunistically as operands and execution resources for present invention does not generate a flush operation. The 

the indicated operations become available. In one 30 present invention thus reduces the frequency of the flush 

embodiment, each of execution units 222, 228, and 230 are action significantly, thus improving the performance of the 

equipped with a reservation station that stores instructions processor. 

dispatched to that execution unit until operands or execution Furthermore, the context-synchronizing-required flag is 

resources become available. After execution of an instruc- ^ n ot set when the context-synchronizing-required instruction 

tion has terminated, execution units 222, 228, and 230 store ^ dispatched. Instead, the flag is set when the corresponding 

data results, if any, within either GPRs or FPRs, depending instruction completes. This implementation solves the prob- 

upon the instruction type. In more conventional processors, i em 0 f speculative context-synchronizing-required instruc- 

execution units 222, 228, and 230 notify completion unit t j ons setting the context-synchronizing-required flag and 

240 which instructions have finished execution. Finally, t nen aborting. 

instructions are completed in program order out of the 40 A , of regisler 2 90 could store the context- 
completion buffer of completion unit 240. Instructions synchronizing-required flag (CSR flag) and be monitored by 
executed by FXU 222 and FPU 230 are completed by me flow di iUu strated in FIG. 3 described below, 
transferring data results of the instructions from GPR , . ... 

rename buffers 233 and FPR rename buffers 237 to GPRs W [ th res P ect l ? ' he P resent lnv , e "' 10n ' a GOUt ^ m 

232 and FPRs 236 respectively 45 synchronizing-required instruction could be any one of the 

' . .... following, which are described in detail in the "PowerPC 

However, in various embodiments, the invention utilizes Arch itectwe: A Specification [or a New Family of RISC 

the dispatch logic of the processor to "tokenize" a classical Processors » referenced previously: 

Von Neumann instruction stream into a data flow-style . . . ... . . . 

format. Thus, data dependencies are not handled by tracking m icbl Option cache block invalidate): this instruction 

the storage location of source data required by each 50 "^validates a block of instructions m the instruction 

instruction, as in register renaming, but rather by associating cacne. 

with an instruction certain information which enables track- mtmsr, mtmsrd (move to MSR and move to MSR double): 

ing source data by reference to another instruction which is m ° ve the content of a GPR to the MSR. 

to provide the source data. Accordingly, the processor is 5J mtspr (move to SPR) ASR, SDR1, HID, MMCR, PMC, 

provided with an instruction tag ("ITAG") generator which DABR, 1ABR, ACCR: move the content of a GPR to 

generates tokens, or tags, each of which is uniquely asso- the SPR. In particular, the ASR, SDR1, HID, MMCR, 

ciated with an instruction upon dispatch. The ITAGs are PMC, DABR, IABR, and ACCR are specific SPRs that 

used to retain program order information. contain a special state of Ihe machine. 

The dispatch unit 220 in the present invention not only 60 Changing them requires a context-synchronizing inslruc- 

assigns ITAGs and dispatches instructions, but also updates llon - 

various tables which are used to track the status of the slbie (invalidate segment lookaside table entry): invali- 

dispatched instructions. date a segment entry in the segment lookaside table. 

Completion unit 240 includes a Group Completion Table tlbie (invalidate page lookaside table entry): invalidate a 

("GCT"), which is a first-in-first-out buffer that saves rel- 65 page entry in the page lookaside table, 

evant information of interruptible instructions that have been mtsr, mtsrin (move to SR, move to SR indirect): move Ihe 

dispatched but not yet completed. The interruptible instruc- content of a GPR to the SR. 
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The present invention is described in further detail in FIG. 
3. In step 301, instructions are completed using the comple- 
tion unit 240. In step 302, a determination is made whether 
the instruction next to complete is a context-synchronizing- 
required instruction. If not, the process proceeds to step 304. 5 
However, if in step 302, the instruction next to complete is 
a context-synchronizing-required instruction, then the 
context-synchronizing-required flag is set in step 303. Sev- 
eral instructions may complete before or after the context- 
synchronizing-required instruction. However, for the sake of JQ 
clarity, this is not further illustrated in FIG. 3. 

The CSI flag can also be reset by a Bush of completing 
instruction that is different than the ISYNC. This occurs for 
the following case: 

CSI instruction completes (step 302), then set CSI flag 
(step 303), 

other instruction (not the ISYNC) completes which causes 
a flush (the instruction may require a flush itself or it 
encounters an interrupt condition) (step 304). 
Since the flush operation occurs, the CSI is reset to 20 
prevent an unnecessary flush when the ISYNC is eventually 
refetched and completed. If a completing instruction does 
not cause a flush, the process merely proceeds from step 303 
to step 306. 

Thereafter, an ISYNC instruction will move down the 25 
completion unit pipeline to be completed in step 306. In step 

307, the present invention determines if the context- 
synchronizing-required flag has been set. If yes, then in step 

308, a flush is performed of all speculatively dispatched or 
fetched instructions after the ISYNC instruction. The 30 
instruction after the ISYNC is then refetched. Then in step 

309, the context-synchronizing-required flag is cleared, or 
reset. The process then continues with the completion of the 
refetched instructions (step 301). 

A novelty of the present invention is that when the 35 
context-synchronizing-required flag is not set when an 
ISYNC instruction is completed, the flush operation in step 
308 is not performed. Instead, the process proceeds from 
step 307 to step 301 to the continuing of the completion of 
instructions. This implementation, as noted above, reduces 40 
the frequency of the flush action significantly improving the 
performance of the processor. 

To ensure the order of larwx and subsequent load 
instructions, a scoreboard mechanism is implemented on the 
ISYNC instruction. The ISYNC instruction is held from 45 
being dispatched until all prior load instructions have 
executed. Note that the present invention chooses to hold the 
ISYNC instruction at the dispatch stage. Other implemen- 
tations may decide to hold the ISYNC instruction at other 
stages as long as it ensures that the ISYNC instruction 50 
prevents subsequent load instructions from executing before 
prior load instructions have executed. 

Although the present invention and its advantages have 
been described in detail, it should be understood that various 
changes, substitutions and alterations can be made herein 55 
without departing from the spirit and scope of the invention 
as defined by the appended claims. 

What is claimed is: 

1. A method for completing instructions comprising the 
steps of: 60 

determining if an instruction to complete is a context- 
synchronizing-required instruction; 

completing an isync instruction subsequent to completion 
of the instruction; and 

not flushing instructions fetched subsequent to the isync 65 
instruction if the instruction to complete is not a 
context-synchronizing-required instruction . 
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2. The method as recited in claim 1, further comprising 
the step of: 

flushing instructions fetched subsequent to the isync 
instruction if the instruction to complete is a context- 
synchronizing-required instruction. 

3. The method as recited in claim 2, further comprising 
the step of: 

setting a context-synchronizing-required flag if the 
instruction to complete is a context-synchronizing- 
required instruction, wherein the flushing step is per- 
formed if the flag is set. 

4. The method as recited in claim 3, further comprising 
the step of: 

resetting the context-synchronizing-required flag after the 
flushing step. 

5. The method as recited in claim 4, further comprising 
the step of: 

if the instruction to complete causes a flush, resetting the 
flag. 

6. The method as recited in claim 1, further comprising 
the step of: 

if the instruction to complete causes a flush, not flushing 
instructions fetched subsequent to the isync instruction. 

7. A processor comprising: 

a fetcher operable for fetching instructions from memory; 
one or more execution units operable for executing the 
instructions; 

a dispatch unit operable for dispatching the instructions to 

the one or more execution units; 
a completion unit operable for completing executed 

instructions; 

circuitry operable for determining if an instruction in the 
completion unit is a context-synchronizing-required 
instruction; 

circuitry in the completion unit operable for completing 
an isync instruction subsequent to completion of the 
instruction; and 

circuitry operable for not flushing instructions fetched 
subsequent to the isync instruction if the instruction to 
complete is not a context-synchronizing-required 
instruction. 

8. The processor as recited in claim 7, further comprising: 
circuitry operable for flushing instructions fetched subse- 
quent to the isync instruction if the instruction to 
complete is a context-synchronizing-required instruc- 
tion. 

9. The processor as recited in claim 8, further comprising: 
a context-synchronizing-required flag; and 

circuitry operable for setting the context-synchronizing- 
required flag if the instruction to complete is a context- 
synchronizing-required instruction, wherein the flush- 
ing of the instructions is performed if the flag is set. 

10. The processor as recited in claim 9, further compris- 
ing: 

circuitry operable for setting the context-synchronizing- 
required flag after the flushing of the instructions. 

11. The processor as recited in claim 10, further compris- 
ing: 

if the instruction to complete causes a flush, circuitry 
operable for resetting the flag. 

12. The processor as recited in claim 7, further compris- 
ing: 

if the instruction to complete causes a flush, circuitry 
operable for not flushing instructions fetched subse- 
quent to the isync instruction. 
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13. A computer program product adaptable for storage on 
a computer readable medium, the computer program product 
operable for completing instructions comprising the pro- 
gram steps of: 

determining if an instruction to complete is a context- 

synchronizing-required instruction; 
completing an isync instruction subsequent to completion 

of the instruction; and 
not flushing instructions fetched subsequent to the isync 

instruction if the instruction to complete is not a 

context-synchronizing-required instruction. 

14. The computer program product as recited in claim 13, 
further comprising the step of: 

flushing instructions fetched subsequent to the isync 
instruction if the instruction to complete is a context- 
synchronizing-required instruction. 

15. The computer program product as recited in claim 14, 
further comprising the program step of: 
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setting a context-synchronizing-required flag if the 
instruction to complete is a context-synchronizing- 
required instruction, wherein the flushing program step 
is performed if the flag is set. 

16. The computer program product as recited in claim 15, 
further comprising the program step of: 

resetting the context-synchronizing-required flag after the 
flushing program step. 

17. The computer program product as recited in claim 16, 
1 further comprising the program step of: 

if the instruction to complete causes a flush, resetting the 
flag. 

18. The computer program product as recited in claim 13, 
further comprising the program step of: 

if the instruction to complete causes a flush, not flushing 
instructions fetched subsequent to the isync instruction. 
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It is certified that error appears in the above-identified patent and that said Letters Patent is 
hereby corrected as shown below: 



Title page. Item [54] and Column 1, line 1, 

Please replace "SYSTEM AND METHOD FOR HANDLING INSTRUCTIONS 
OCCURRING AFTER AN ISYNC INSTRUCTION" with ~ SYSTEM FOR 
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ISYNC BARREER INSTRUCTION IN PROGRAM ORDER --. 
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